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We show that weighted unitary 2-designs define optimal measurements on the 
system-ancilla output state for ancilla-assisted process tomography of unital quan- 
tum channels. Examples include complete sets of mutually unbiased unitary-operator 

^!~ ' bases. Each of these specifies a minimal series of optimal orthogonal measurements. 

^^ ■ General quantum channels are also considered. 

(N 

^ . 1. INTRODUCTION 

(N 

Fundamental to the fabrication of quantum information processing devices [1], such as 

^ ■ quantum teleporters, key distributers, doners, gates, and indeed, quantum computers, is the 

I ■ ability to precisely determine an unknown transformation on a quantum system. Quality 

^ , assurance requires a complete characterization of these devices, which can be accomplished 

^ I through a procedure known as quantum process tomography [2]: for judicious choices of initial 

qh' system states, the transformation is uniquely identified by the outcomes of measurements 

on the transformed states. 
^ ! The approach of ancilla-assisted quantum process tomography [3, 4] is to encode all infor- 

^ I mation about the transformation into a single bipartite system-ancilla quantum state, and 

thus completely reduce the problem to that of quantum state tomography [2]. A sequence 
of measurements on identically prepared copies of this state will then reveal the particular 
transformation under examination. It is known that, for linear tomographic reconstruc- 
tions of general quantum states, the most robust measurements against statistical error 
are described by tight informationally complete positive-operator-valued measures (tight 
O ■ IC-POVMs) [5]. Such measures derive their name from a related concept in frame theory, 

called a tight frame [6], and are equivalent to weighted complex projective 2-designs [5, 7-9]. 
k>( ; The nonselective evolution of an open quantum system is described by a completely 

Vh ' positive, trace preserving, linear transformation on quantum states. Such transformations 

^ are called quantum channels within the context of quantum information theory [1], as they 

also describe the degradation of information encoded in quantum states after transmission 
through a noisy communication channel. A unital quantum channel is one which fixes the 
maximally mixed state. These include all probabilistic applications of unitary operators, 
and thus, within the context of process tomography, form the relevant subclass of channels 
describing closed-system quantum dynamics. 

In this article we study ancilla-assisted process tomography of general and unital quantum 
channels. The possible system-ancilla output states of relevance are then naturally housed 
in proper convex subsets of the set of all quantum states, and thus permit optimizations 
of the measurement over that necessary to identify a general quantum state. We find that 
the most robust measurements against statistical error, when they exist, are again described 
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by tight POVMs, though a generahzation thereof. In the unital case, these POVMs are 
equivalent to weighted unitary 2-designs [10, 11], but in the general case, they are shown 
not to exist. 

The article is organized as follows. Sections 2 and 3 review quantum process and state 
tomography, respectively, paying particular attention to the pertinent case of ancilla-assisted 
process tomography of quantum channels. Section 4 generalizes results of Ref . [5] , character- 
izing the structure of POVMs that are optimal for linear quantum state tomography when 
a member of a convex subset of all possible quantum states need only be distinguished from 
other members. In Sec. 5 we introduce the concept of a weighted unitary t-design, reviewing 
known results and presenting new ones. Finally, in Sec. 6 we make the connection between 
weighted unitary 2-designs and the POVMs that optimize ancilla-assisted process tomogra- 
phy of unital quantum channels. The article then concludes in Sec. 7 where open problems 
are discussed. In addition, an appendix sets the superoperator notation used throughout this 
article by reviewing a general class of transformations on quantum systems called quantum 
operations. 

2. QUANTUM PROCESS TOMOGRAPHY 

The purpose of this article is to optimize the measurements used for ancilla-assisted pro- 
cess tomography of quantum channels, and in particular, unital quantum channels. Quantum 
channels are nonselective quantum operations. The appendix provides some background to 
this broad class of transformations on quantum systems and introduces important concepts 
and notations relevant to the current study of channels. We will proceed by first describing 
process tomography for unitary operations. This will lead naturally into that for channels. 

The dynamical evolution of a closed quantum system Tig = C'^ is described by a unitary 
operator U G U((i). In the absence of any physical description of the system, we expect 
that the Haar probability measure /x on U((i) most accurately reflects our state of knowledge 
of U . One method to determine U is then to couple Tig to an auxiliary system Ti^ = C^"^ 
(called the ancilla) and allow the combined system Tig (g> Ti.^ to evolve from some initially 
known state, p[ say, to 

p = {U ® I) p,{U^ ® I) . (2.1) 

A measurement on the combined system will then provide information on U . By repeating 
this procedure many times over, perhaps on different input states, U can be determined 
completely. This method of determining quantum dynamics is called quantum process to- 
mography. Although the ancilla could be removed if the initial state were varied, in this 
article we investigate the opposite extreme by choosing dg, = d and then pi = |/)(/|, fixed, 
where for any V &l]{d) we define 

\V) = {V®I)\I) := -^J2^\k)®\k). (2.2) 

k 

The pure state |V^)(V^| is a maximally entangled state of 7Yg(8>?ia, and in fact, all maximally 
entangled states can be written in this form. The output state is p = \U){U\. Note that U 
can be found from \U) (and vice versa) through the relation {j\U\k) = v^((j| ^ {k\)\U), a 
special case of the Jamiolkowski isomorphism below. 

The determination of an unknown unitary U is thus equivalent to the determination of an 
unknown maximally entangled state \U). The latter can be accomplished through quantum 



state tomography. It is unrealistic, however, to presume that each system evolution in the 
above tomographic procedure can be performed identically. The class of quantum states 
under examination should thus be broadened to include any classical mixture of maximally 
entangled states: p = '^i^rk\Uk){Uk\, where each r^ > and '^f.rk = 1. This is the output 
state of a quantum channel. 

The (nonselective) evolution of an open quantum system is described by a quantum chan- 
nel, i.e., a superoperator S G End(End(C'^)) which is both trace preserving and completely 
positive (see the appendix). The channel is said to be unital if it fixes the maximally mixed 
state: S{I) = I- Unital channels include all unitary operations U Q W , and moreover, all 
random- unitary channels, i.e., those which can be implemented by probabilistic applications 
of unitary operators (as above): £ = 'Ylk^kUk Uk^ . In dimensions rf > 3, however, there 
exist unital channels which can not be decomposed in this way [12]. Although random- 
unitary channels are those channels which are most relevant to the study of closed-system 
dynamics, within this context, we will consider the entire class of unital channels together. 

The process tomography of a quantum channel follows that for a unitary operator. The 
output state corresponding to the input pi = |/)(/| completely determines the channel: 

p = {S®I){pO ^ ^ = dJ2 tr[(|m)(j|®|O(A;|)p]|j)(A;|0|/)(m|. (2.3) 

j,k,l,m 

This is the so-called Jamiolkowski isomorphism [13]. It is important that the basis used in 
the right-hand side (RHS) of Eq. (2.3) is that in the definition of [/)(/[. Note that 

trs(p) = lj2^Am^m\j){k\ = lj2^r[\j){k\]\j){k\ = ij, (2.4) 

j,k j,k 

since all quantum channels are trace preserving, and additionally, 

j,k 

for unital channels. This means that the classes of output states of quantum channels, with 
fixed input pi = [/)(/[, do not include all types of quantum states. The same could not be 
said if S were trace decreasing, i.e., a path of a general quantum operation. Denote by 

Q{n) := {A G End{n) \A>0, tr(A) = 1} (2.6) 

the set of all quantum states for H. The two convex subsets of Q(7is ® 'Ha), 

^s^ := {p G QiHs K) I trs(p) = I/d} , and, (2.7) 

^^^ := {p G QiUs K) I tr,(p) = tr,(p) = I/d} , (2.8) 

then correspond to the outputs of, respectively, general and unital quantum channels. 

Let {\k}j^SQ^ be an orthonormal Hermitian operator basis for End(C'*) with the choice 
Ao = I/vd. The remaining operators, \i, . . . , \d2-i, then span the {d^ — l)-dimensional 
subspace of traceless operators: tr(Afc) = v^tr(AoAA;) = for all k > 0. Every quantum 
state for Ti. = C^ (E) C^ is of course expressible in terms of this basis: 

P = X^^jfcAj (g)\k ■ (2.9) 

j,k 



The coefficients must be real, r^-fc G M, but besides positivity of the state, the only other 
constraint is from normalization: Tqq = 1/d. In contrast, the output state p G ^^^ has 
TQk = for all A; > 0, and, in the unital case, p G ^"'^ has r^o = ^ofc = for all A; > 0. 
The number of outcomes of a measuring instrument capable of identifying one such output 
from within its class of output states can thus be reduced from rf^, the number necessary 
to identify a general quantum state, to (P{d^ — 1) + 1 for general channels, or (c?^ — 1)^ + 1 
for unital channels. Ancilla-assisted process tomography of quantum channels is thus not 
equivalent to tomographic reconstructions of general system-ancilla quantum states. 

We will conclude this section by describing how quantum states are naturally embedded 
in Euclidean space. This approach will later provide insight when we harness the concepts 
of frame theory. Embedded in the complex vector space End(7i) is a real vector space of 
Hermitian operators: 

RiH) := {A G End{n) \ A^ = A} . (2.10) 

Equipped with the Hilbert-Schmidt inner product inherited from End(7Y), {A\B) := tr{A^B), 
which induces the Frobenius norm, \\A\\ := a/(A|A), the vector space H(?i) forms a real 
Hilbert space: H(C^) = M^ . Within the context of ancilla-assisted process tomography it 
will be assumed that H = Hs ^H^ = C^ ^ C^ = C^, i.e., D = d?. The above coefficients 
rjjt then define a canonical choice for the isomorphism to MP . Define the two subspaces 

HS^ := {A G H(C'^®C'^) | tri(A) = ti{A)I/d} < H(C^) , and, (2.11) 

H"" := {A G H(C'^ ® C^) I tri(A) = tr2(A) = ti{A)I/d} < W < H(C^) , (2.12) 

which contain the convex sets ^^'^ and ^"'^, respectively, and have dimensions d'^{d'^ — 1) + 1 
and (rf2 - 1)2 + 1. 

In general, Q(?i) is naturally embedded into the vector subspace of H(7i) consisting of 
all traceless Hermitian operators: 

YioiH) := {A G YHH) \ tr(A) = 0} < Yi{H) . (2.13) 

Define 

no := I-;^|/)(/| (2.14) 

which projects onto Ho(7i). This projection defines an isometric embedding of Q(C'^) into 
a (D^ - l)-dimensional real Hilbert space, Q(C^) ^ Ho(C^) ^ M^'~\ 

Ipo) := HoIp) = \p-I/D), (2.15) 

in which the images of pure states lie on a sphere, || \il)) {il)\ — //-D|| = \/{D — l)/D, and the 
images of mixed states within. In the special case D = 2 the embedding is bijective into this 
sphere, realizing the Bloch-sphere representation of a qubit, but is otherwise only injective. 
By 'isometric' we mean that distances are preserved: ||po ~ '^oll = \\p ~ c"||- 

It is important to recognize that both ^^c g^^id ^^'^ are embedded into proper vector 
subspaces of Ho(C'^) under XIq : 

^sc ^ ^sc ._ Yi^j^sc = {Ae H(C'^ ® C^) I tri(A) = 0} , and, (2.16) 

^uc ^ guc ._ j-j^guc = 1^ ^ H(C'^ ® C^) I tri(A) = tr2{A) = 0} . (2.17) 

The dimensions of these subspaces are d'^{d'^ — 1) and (d^ — 1)^, respectively. In Sec. 4 we 
will show that POVMs corresponding to tight frames on these subspaces, if they exist, are 
uniquely optimal for ancilla-assisted process tomography. 



3. QUANTUM STATE TOMOGRAPHY 

This section serves as an introduction to quantum state tomography and is adapted from 
Ref. [5, Sec. 4]. Instead of using the complex vector space End(C^) = C^ as a backdrop, 
however, we will use the embedded real vector space of Hermitian operators, H(C''^) = M.^ . 

The outcome statistics of a quantum measurement on a system 7i = C^ are described by a 
positive- operator-valued measure (POVM) [14]. That is, an operator- valued function defined 
on a o"-algebra over a set ^ of outcomes, F : ^{^) — ^ H(7Y), which satisfies (1) F{S) > 
for all ^ e <8(jr) with equality if (? = 0, (2) F(U^^i 4) = J2T=i^i^k) for any sequence 
of disjoint sets S'^. G ®(J?r), and (3) the normalization constraint F{j^) = /. In this article 
we always take 03 (e^) to be the Borel o"-algebra. When a quantum measurement has a 
countable number of outcomes, the indexed set of POVM elements {F{x)}j:^x completely 
characterizes F, and is thus often referred to as the "POVM." We will call such POVMs 
discrete. 

We will need to express an arbitrary POVM F in a standard form. To do this, note that 
each POVM defines a natural scalar- valued trace measure t{(S) := tr[F(f9')], which inherits 
the normalization t{^) = D. Since each matrix element of F is a complex-valued measure 
which is absolutely continuous with respect to the nonnegative finite measure r, the POVM 
can be expressed as 

F(^) = f At{x)P{x) , (3.1) 



where the positive-operator-valued density (POVD) P : ^ ^ H(7i) is uniquely defined up 
to a set of zero r-measure. The POVD P is of course the Radon-Nikodym derivative of 
F with respect to r. Note that tr(P) = 1, r-almost everywhere. If P also has unit rank, 
r-almost everywhere, then we call F a rank-one POVM. In the special case of a discrete 
POVM, P{x) = F{x)/t{x). 

An informationally complete POVM F [5, 15, 16] is one with the property that each 
quantum state p is uniquely determined by its measurement statistics, p{S) '■= tT[F{S')p]. 
A sequence of measurements on copies of a system in an unknown state, enabling an estimate 
of these statistics, will then reveal the state. This process is called quantum state tomography . 

Definition 3.1. A POVM F : ^{^) -^ H(7i) is called informationally complete with 
respect to ^ (1 Q(7i) if for each pair of distinct quantum states p ^ a E ^ there exists 
an event S' e <B( JT) such that tr [F(^)p] 7^ tr [F{S')(t]. A POVM which is informationally 
complete with respect to Q(7Y) is called an informationally complete POVM (IC-POVM). 

For an arbitrary POVM F, define the Hermitian superoperator JF : H(7i) -^ H(7i) by 

J^ := f dT{x)\P{x)){P{x)\ , (3.2) 

J .9: 

which is positive and bounded under the left-right action. The latter follows from the fact 
that Tr(jF) = J,^.dT{x) {P{x)\P{x)) < D for any POVM, with equality only for rank-one 
POVMs. The image and coimage of a Hermitian superoperator are equal. We call this 
vector subspace of H(7i) the support of JF and denote it by supp(jF). Let span(^) denote 
the subspace of H(7i) spanned by members of JS, and let ri(^) denote the relative interior 
of ^, which is the interior of =S as a subset of its affine hull. Now consider the following. 



Proposition 3.2. Let F : Q3(<^) -^ H(7i) be a POVM. Then F is informationally complete 
w.r.t. JS C Q(7Y) if ^ '^ supp(jF). Moreover, if n{J2) ^ 0, then F is informationally 
complete w.r.t. J2 if and only if ^ '^ supp(jF). 

Proof. Let =2 C supp(jF). Then for the distinct quantum states p 7^ o" G ^ we have 

/ dr(a;)|tr[P(a;)(p-a)]|2 = (p - a\J^\p - a) > 0, (3.3) 

since p — a E supp(jF), being a vector subspace, and p — c 7^ 0. Thus there must exist an 
event <? G Q3(^) with 

f dT{x)tT[P{x){p-a)] ^ 0, (3.4) 

or equivalently, tr[F(^)p] 7^ tr[F{(S')a]. This means F is informationally complete w.r.t. =S. 
Now let F be informationally complete w.r.t. ^, let ri(.^) 7^ 0, and suppose J2 ^ 
supp(jF). There must then exist an operator A G span(.^), A 7^ 0, such that 



iA\T\A) = / dT{x)\tT[P{x)A]\' = 0, (3.5) 

which means tr(PA) = 0, r-almost everywhere. This operator is therefore traceless: 

tr{A) = tr[F{^)A] = f dr{x) ti[P{x)A] = . (3.6) 

Now for any p G ri(^), if e > is chosen small enough, then a = p + eA is also a member 
of 13, and moreover, o ^ p with 



\.x\F{S)g\ = tr[F(^)p] + e f dr(a;) ti[P{x)A] = tr[F(^)p] 

7,r 



(3.7) 



for all (o G Q5(j2r). Thus F could not have been informationally complete w.r.t. .^. We 
must therefore have ^ C supp(jF). D 

This proposition is a straightforward but important observation. If a quantum state need 
only be distinguished from other members of a given convex subset <^ C Q(7i), e.g. ^^'^ or 
^"'^, then since the relative interior of any convex set is nonempty. Proposition 3.2 enables 
us to focus on POVMs for which supp(jF) D ^. Equivalently, it enables us to focus on 
POVMs for which supp(jF) > span(=S). 

Note that any POVM F is informationally complete w.r.t. J2 = supp(jF) n Q(7i). With 
this choice there is a standard procedure for reconstructing a member, p G .^, in terms of 
its measurement outcome statistics, p{(S') = tr [F{(S')p]. Let JF be the unique superoperator 
with supp(jF) = supp(jF) for which 

jrjr = jrjF = n^ ^ (3.8) 

where IIjf denotes the projector onto supp(jF). Of course, JF = JF^^ when .^ = Q(7Y) (as 
in Ref. [5]). Now defining 

\R) := ^\P) , (3.9) 



the left-right action on \p) of the identity 



dT{x)\R{x)){P{x)\ = / dT{x)J^\P{x)){P{x)\ = J^J^ = U:p, (3.10) 

allows state reconstruction in terms of the measurement statistics: 



p = / dT{x) tT[P{x)p]R{x) = / tT[dF{x)p]R{x) = / dp{x)R{x) . (3.11) 

Although the reconstruction operator-valued density R is generally not positive, it inherits 
all other properties of P, i.e. ^ ^dT{x)R{x) = I and tr(_R) = 1 (see Ref. [5]). Finally, it is 
straightforward to confirm that 

:F = f dT{x)\R{x)){R{x)\. (3.12) 

Although we could now proceed directly to an analysis of optimality, we will first briefiy 
show how to embed POVMs into Euclidean space, just as was done for quantum states. 
First note that, for an arbitrary POVM F, the subspace Ho(7Y) < H(7i) is jF-invariant, and 
in fact, 

^ = no^no + -^|/)(/|, (3.13) 

since the identity operator is always a left-right eigenvector with unit eigenvalue: 

J^\I) = I dT{x)\P{x)){P{x)\I) = I dr{x)\P{x)) = I \dF{x)) = \I) . (3.14) 

Although the projector \I){I\/D is fixed by the normalization of F, its complement is free: 

^0 := Ho^no = ^-^|/)(/| = / dr{x)\Po{x)){Po{x)\, (3.15) 

where we define 

|Po) := HolP) = \P-I/D). (3.16) 

It is the superoperator JFq which can be adjusted for optimality. The POVM is thus embed- 
ded into Ho(H): 



|Po(^)) := no|P(^)) = |P(^)-r(^)//D) = / dr(x) |Po(x)) . (3.17) 

This means tr(Po) = and Fq[^) = 0. Note that our "embedding" is anchored to the trace 
measure, however, in that we cannot find F from Fq without knowledge of r. 

4. OPTIMAL LINEAR QUANTUM STATE TOMOGRAPHY 

In this section we decide which POVMs are the most robust against statistical error for 
linear tomographic reconstructions of quantum states. Our premise is that a member of 
some subset of all possible quantum states needs to be distinguished from other members. 
Although we will focus on the two convex subsets, ^^'^ and ^^^, being those sets relevant to 



the process tomography of general and unital quantum channels, the following analysis will 
apply to other convex subsets ^ C Q(7i) with similar properties. The choice ^ = Q(7Y) 
was considered in Ref. [5] and the following can be considered a generalization. 

Let F be a POVM which is informationally complete w.r.t. J2 C Q(7i). Throughout this 
section we assume a linear state-reconstruction formula valid for all p G .^ of the form 

\p) = [ dp{x)\Q{x)) = [ {dF{x)\p)\Q{x)), (4.1) 

where Q : ^ -^ span(^) is called a reconstruction OVD. By linearity, this formula implicitly 
presupposes that supp(jF) > span(^). 

It is instructive to start with the special case of a discrete POVM, {F{x)}x£.:r- Suppose 
that yi, . . . ,yN are the outcomes of measurements on A^ identical copies of the state p E ^. 
One estimate for the outcome probabilities is then 

1 ^ 
p{x) = p{x;yi,...,yN) := —^5{x,yk), (4.2) 

k=l 

which gives 

P = p{yu---,yN) ■■= ^p{x;yi,...,yN)Q{x) , (4.3) 

for an estimate of p. We will call p a linear tomographic estimate of p to distinguish it from 
more sophisticated choices, such as those from maximum likelihood estimation [17, 18] or 
Bayesian mean estimation [19-23]. 

The mean squared Hilbert-Schmidt distance provides a measure of the expected error in 
our estimate: 



yi,---,yN 



e--\p) := E \\\p-p{y,,...,yMW (4-4) 



= 5^ E [{p{x)-p{x)){p{y)-p{y))]{Q{x)\Q{y)) (4.5) 

=■■ ^(Ap(g)-tr(p2)), (4.7) 
using Eq. (4.1) and given that 

E [{pix) - p{x)) {p{y) - p{y))] =—(pix)5{x,y)-p{x)p{y)j, (4.8) 



yi,--;yN 



which is an elementary calculation. Equation (4.7) is also a fitting description of the error 
for a POVM with a continuum of measurement outcomes if we define 

Ap(g) := / dpix) iQix)\Qix)) (4.9) 

in general. This is because a countable partition of the outcome set ^ allows any POVM 
to be approximated by a discrete POVM. Our estimate p remains a good approximation for 
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the probability measure p, except now with x and yi, ... ,y]\f in Eq. (4.2) indicating members 
of the partition. In the hmit of finer approximating partitions we again arrive at Eq. (4.7) 
for the average error, but now with Eq. (4.9) for Ap{Q). Since we have no control over the 
purity of p, it is this quantity which is now of interest. 

The POVM which minimizes Ap{Q), and hence the error, will depend on the quantum 
state under examination. When JS = Q(C^) it is natural to remove this dependence by av- 
eraging over all Hilbert-space orientations between the system and measurement apparatus. 
That is, we set p = p{a, U) := UaW where a G Q(C^) is fixed, and average Ap{Q) over ran- 
dom choices oi U e IJ{D). When ^ = ^s^ ^""^ C Q(7^, o H^) the natural procedure is to 
average over all local Hilbert-space orientations f/g G l]{d) between the system and measure- 
ment apparatus, and all local Hilbert-space orientations [4 G l]{d) between the system and 
ancilla. The end result is the same, however. Setting p = p{a, Ug^Ua) = (f/s®f^a)cr(f4®f^a)^ 
we take the average over all Us,Ui, G \J{d) : 



// dp{U,)dp{U,) ApiQ) 

J JV(d) 



d/i(f/s)d^(f/a) / ii[dF{x){U,®U^)a{U,®U^)%Q{x)\Q{x)) (4.10) 

U(d) J X 

= 1 / ii[dF{x)]ii{a){Q{x)\Q{x)) (4.11) 

= ^1 dT{x) {Q{x)\Q{x)) (4.12) 

=:^^r{Q), (4.13) 

where /x is the unit Haar measure, using Shur's lemma for the integrals. 

It would be presumptuous to take At-{Q) as an error estimate for an arbitrary subset 
■2 C Q(C^) without further information on its structure. Nevertheless, assume that there 
is a natural set of possible "orientations" ^ C U(D) between Ti = C^ and the measuring 
apparatus, and a probability measure u on ff, with the property that for any a E J2, 

du{U)UaU^ = —I . (4.14) 

Then J^du{U) Ap{Q) = A^{Q)/D as above. We thus take ^ = \J{d) U((i) and u = p x p, 
when ^ = J2^'^ or JS^'^. Another example is 

^'^^ '■= {p G Q(C'^) I p is diagonal in the standard basis} . (4-15) 

The members of ^'^' might be described as "classical" states, being convex combinations of 
basis states: p = X]fe^fcl^)(^l- Under random permutations U of basis elements, Eq. (4.14) 
is satisfied for any a G ^'^^ In general, we suspect that the above averaging of the error 
makes sense whenever .:^ is a convex subset of Q(C'^), containing the completely mixed state 
I/D, and possessing a symmetry about this state described by Eq. (4.14). 

We now proceed to an analysis of optimality. The above considerations are summarized 
as a definition: 

ei^'^\a) := j ^du{U) e^^>Q\p{a,U)) = -^[a^{Q) - D ir{a')) . (4.16) 
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Our goal now is to find the optimal pairs {F, Q) which minimize eL for a given fixed ^. 
There are generally many different choices for the reconstruction OVD Q : ^ ^ span(.^) 
that satisfy Eq. (4.1). Our next task is to show that the canonical choice we encountered 
in Sec. 3 is uniquely optimal. We thus minimize ^t{Q) over all Q while keeping F fixed. 
Our only constraint is that our state-reconstruction formula [Eq. (4.1)] remains valid for 
all p G =S. By linearity, this formula is then also valid for any member of span(=S), and 
therefore, 

/ \Q{x)){dF{x)\U^ = [ dT{x)\Q{x)){P'{x)\ = n^, (4.17) 

where 11^ projects onto span(^) and \P') := II^\P). This means {Q{x)}x£S' is a dual frame 
to the frame {P'{x)}x£S', w.r.t. r, within the subspace span(^). Consult Christensen [24] 
for an introduction to frame theory (see also Ref. [5]). The following lemma shows that 
there is a unique canonical dual frame which is optimal (see Ref. [5, Lemma 16] for a proof). 

Lemma 4.1. Let {A{x)}x£.f be an operator frame w.r.t. the measure a. Then for all dual 
frames {5(a;)}^g,r, 

da{x){B{x)\B{x)) > / da{x) {A{x)\A{x)) , (4.18) 

with equality only if B = A, a-almost everywhere, where {A(a;)}^g,^- is the canonical dual 
frame, i.e., \A) := A'^\A) with A := J,^ da{x)\A{x)){A{x)\. 

Let J-'' be the inverse of JF' := II^jFn^ in the subspace span(=S) and define \R') := JF'|P'). 
This means 

^' = f dT{x)\R'{x)){R'{x)\. (4.19) 

Lemma 4.1 shows that 

AriQ) > Ar{R') = Tr(r) , (4.20) 

with equality only if Q = R', r-almost everywhere. We thus make this choice and now 
minimize the quantity Tr(jF') over all POVMs. Define 6' := dimspan(,^). 

Lemma 4.2. Let F : !B(J?r) -^ H(C'^) be a POVM which is informationally complete w.r.t. 
^ C Q(C-°), where ri(^) ^ ^ and I e span(^). Then 



Tr(.FO > ^^^-^ + 1 , (4.21) 



with equality if and only if 



^=f^n, + J:^|/)(/|. (4.22) 

Proof. Since F is informationally complete w.r.t. ^ and ri(.^) ^ 0, by Proposition 3.2, we 
must have supp(jF) > span(cS). Thus JF' = H^JFH^ has 5' nonzero left-right eigenvalues: 
Ai, . . . , X^i > 0. One eigenvalue is fixed at unity, however, since J-''\I) = n^jFn^|/) = [/) 
when [/) G span(=S), given that [/) is always an eigenvector of JF [Eq. (3.14)]. We thus take 
Ai = 1. The remaining eigenvalues satisfy 

S' 

^ Afc = Tr(J^') - 1 < Tr(J^) - 1 < D-1 , (4.23) 

fc=2 
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given that Tr(jF') = Tr(n^jF) < Tr(jF), with equahty if and only if supp(^) = span(^), 
and Tr(jF) < D, with equahty if and only if F is a rank-one POVM. Under this constraint, 
it is straightforward to show that 

^' 1 
Tr(-F') = $^- + 1 (4.24) 

takes its minimum value if and only if A2 = ■ ■ ■ = A^' = (-D — l)/(5' — 1), or equivalently, 

^' = f3T(n^-:^l^)(^l) + ;^l^)(^l- (4-25) 

Note that Tr(jF') = D with this choice, however, which requires supp(jF) = span(^). We 
must therefore have T' = T and Eq. (4.25) is equivalent to Eq. (4.22). Finally, the minimum 
value of Eq. (4.24) is Tr(^') = {8' - 1) ■ ([b' - 1)/{D - 1)) + 1 = {5' - lf/{D - 1) + 1. D 

It is important to recognize that our condition for optimality [Eq. (4.22)] sets supp(jF) = 
span(^). We now proceed under this assumption, effectively replacing each primed symbol 
above by its unprimed counterpart. Furthermore, optimality requires a rank-one POVM. 
This is because Eq. (4.22) gives Tr(jF) = D, which is possible only for rank-one POVMs. 
When ^ = Q{C^) we recover the tight rank-one IC-POVMs described in Ref. [5]. Let us 
use the same terminology here. 

Definition 4.3. Let F : <B( JT) -^ H(C^) be a POVM. Then F is called Ught if the OVD 
{Po(2;)}xe.r forms a tight operator frame w.r.t. r in supp(jFo), i.e., 

J^o ■■= I dT{x)\P^{x)){P^{x)\ = aU^Fo, (4.26) 

for some constant a > 0, or equivalently, 

T = aU^ + ^\I){I\. (4.27) 

The constant satisfies a < {D — l)/{6 — 1), where 6 := rank(jF) (left-right rank), with 
equality only for rank-one POVMs. Returning to Eq. (4.22) we see that tight rank-one 
POVMs are precisely those which are optimal for linear quantum state tomography. That 
is, 

^=^n, + ^|/)(/| (4.28) 

if and only if F is a tight rank-one POVM. The optimal state-reconstruction formula is given 
by Eq. (3.11), which now takes the form 

since a straightforward calculation of JF from Eq. (4.28) shows that 

R= ^^P - 4^1- (4-30) 

D-1 D{D-l) ^ ' 

We now restate our findings in a theorem. 
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Theorem 4.4. Let F : ^(^) -^ H(C'^) be a POVM which is informationally complete 
w.r.t. =S C Q(C^), assumed convex and containing I/D, and let 5' = dim span (,^). Then 
for any fixed quantum state a G =S, 

ei^'^H-) > m{^^n^ + 1-Dtria-)^ , (4.31) 

for all reconstruction OVDs Q. Furthermore, equality occurs if and only if Q = R and F 
is a tight rank-one POVM with supp(jF) = span(^). 

Now consider the worst-case error. The average provides a lower bound: 

ei^'«)(a) := supe(^'«)(p(a,f/)) (4.32) 

> 4f''^V) (4.33) 

Returning to Eq. (4.7), however, but now with Q = R and p = p{a, U) = UaU\ we find 

e<««>(p('^. t')) = ^ ( /,, *(a^) (R(-J:)\Ri.x)) - t.i(a')^ (4.35) 

when R satisfies Eq. (4.30) and P is rank one, regardless of orientation f/ G i^ C \][D). Thus 
given 5 = 5' [supp(jF) = span(,^)] when eL is minimized, the following is a consequence. 

Corollary 4.5. Let F : Q3(^) — *> H(C'^) he a POVM which is informationally complete 
w.r.t. ^ C Q(C^), assumed convex and containing I/D, and let 5' = dimspan(,:^). Then 
for any fixed quantum state a E JS, 

e^Jf\-) > ^(^^^ + 1 -Dtr{a^)^ , (4.38) 

for all reconstruction OVDs Q. Furthermore, equality occurs if and only if Q = R and F 
is a tight rank-one POVM with supp(jF) = span(^). 

Tight rank-one POVMs are thus optimal for linear quantum state tomography in both 
an average and worst-case sense. In fact, they form the unique class of POVMs that achieve 

el^'^^(-) = ei^'«)(a) = e^''^\p{a,U)) = ]^ (^^^ + 1 " ^tr(.^)) . (4.39) 

The exact structure of these POVMs for ^ = ^^^ and ^ = .^^^, when they exist, will be 
explored in detail in Sec. 6. To do so, however, we first need to explain the concept of a 
"unitary t-design." This is done in the following section. When ^ = Q(C'^) we recover the 
results of Ref. [5, Theorem 18 and Corollary 19]. Lastly, consider ^ = ^'^^ . Only 5' = D 
dimensions are then spanned, giving (l — tr(o"^))/A^ for the minimum error. In particular, 
for pure states this is zero. 
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5. UNITARY t-DESIGNS 

The extension of spherical t-designs [7] to the unitary group was recently considered by 
Dankert et al. [10] and Gross et al. [11], and the following definition is equivalent to theirs. 
By a "unitary t-design," however, we really mean a projective unitary t-design, in that each 
e^'^U G U((i) should always be identified with U . With this in mind, let U{x) G U(d) denote 
a representative from the equivalence class of unitaries x G PU((i) = U((i)/U(l) and let ix 
denote the Haar measure on PU((i) with the normalization /i(PU((i)) = 1. A countable set 
S^ endowed with a weight function w : ^ ^ (0, 1], where '^xt^ywi^x) = 1, will be called a 
weighted set and denoted by the pair {y, w). 

Definition 5.1. A finite weighted set {^,w), '3) C PU((i), is called a weighted t-design (in 
dimension d) if 

y^w{x)U{x)®'®{U{xY^')^ = I d/i(a;)f/(a;)^*® (f/(a;)®*)^ (5.1) 

When w{x) = 1/|^| we recover the more common notion of an "unweighted" t-design. 
Define T := i]^-fc|j)(A;| ® |A;)(j|, which satisfies tr[{A ® B)T] = tr{AB) and is called the 
swap (or transposition) since T\ip) |0) = |0) \ip). By multiplying Eq. (5.1) on the right 
by /®(*~i) (g) T ® /®(*-i) and tracing out the inner pair of subsystems, we can immediately 
deduce that every weighted t-design is also a weighted (t — l)-design. Repeating this process 
t times shows that the normalization of w is in fact already implied by Eq. (5.1). Unweighted 
t-designs in PU{d) exist for every t and d : 

Theorem 5.2 (Seymour and Zaslavsky [25]). Let Q be a path- connected topological space 
endowed with a measure uo that is finite and positive with full support, and, let f : Q ^ M™ 
be a continuous, integrable function. Then there exists a finite set ^ C f2 such that 



^E/(^) = ^/d^(^)^(^)- (5-2) 



The size of ^ may be any number, with a finite number of exceptions. 

Corollary 5.3. For each pair of positive integers t and d, and for all sufficiently large n, 
there exist (unweighted) unitary t-designs in dimension d of size n. 

Proof. Simply let Q = Pl]{d), uj = ^, and apply Theorem 5.2 to 

fix) := ?7(x)^*® (f/(x)^*)\ (5.3) 

which maps PU(rf) into End(C'=')®2* ^ M2d«_ □ 

The task of finding t-designs is facilitated by the following theorem. Define the positive 
constant 

7(t,rf) := f d/i(a;)|tr[f/(x)]p* . (5.4) 

ipu(d) 
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Theorem 5.4. For any finite weighted set {^,w), 5^ C PU((i), and any t > 1, 

J2 w{x)w{y)\tT[U{xyU{y)]\'' > j{t,d) , (5.5) 

with equality if and only if {S^,w) is a weighted t-design. 

Proof. Defining S := T.xe^'^i^) U{x)^' ® (f/(a;)®*)t - J^^^^^ d/i(x) f/(x)®* ® (f/(a;)®*)t we 
see that 



< tT{S^S) = V w (x) w{y)\ tip {xyU{y)]\'^'- 2 y^wix) f d/i(i/) | tr[[/(a;)"ff/(i/)]p 



|2t 
L/x^y; I hL[uyj.)u\;y)\ 

x,yey x^y ^PU(d) 



+ / dn{x) d/i(i/)|tr[f/(a;)tf/(y)]|2* (5.6) 

^ ti;(a;)w(i/) | tr[f/(a;)tf/(y)]|2* _ /" d^(a;) | tr[f/(a;)]|2* (5.7) 



with equahty if and only if S* = 0, which is the defining property of a t-design. D 

This theorem allows us to check whether a weighted subset of PU(d) forms a t-design 
by considering only the "angles" between the supposed design elements. It also shows that 
t-designs can be found numerically by parametrizing a weighted set and minimizing the LHS 
of Eq. (5.5). The lower bound can be considered a variation on the Welch bound [26]. The 
constant 7 was calculated by Diaconis and Shahshahani [27] for rf > t, in which case 7(t, d) = 
t!, and by Rains [28] in general. It is the number of permutations a E St (the symmetric 
group [29]) such that (o"(l), o"(2), . . . , a(t)) has no increasing subsequence of length greater 
than d. Thus, for example, 7(1, d) = 1 and 7(2, d) = 2 for all d > 2. 

A unitary 1-design must satisfy 

^w{x)U{x)0U{xy = f dfi{x)U{x)0U{xy = ^T , (5.8) 



where the RHS of Eq. (5.1) is now explicitly evaluated (simply consider a matrix component 
of the integral in the standard product basis and use Schur's lemma). Since T has eigenvalues 
of 1 and —1, respectively, on the symmetric and antisymmetric subspaces of C^ (8> C'^, and 
thus rank(T) = d"^, we must have |^| > d'^ with equality only if ^ is an (orthogonal) unitary 
operator basis, i.e. tT[U{xyU{y)] = for all x 7^ y G &, and w{x) = l/\^\- This fact is 
more apparent when Eq. (5.8) is rewritten in terms of superoperators: 

J2y^ix)\Uix))iUix)\ = il. (5.9) 

xeSi 

In this form it is clear that unitary 1-designs are equivalent to tight unitary frames [5, 30]. 
The unitary operators with matrix elements [30] 



{j\Um\k) := ^exp 



2TTijk 2TTi{j + kd)m 
d n 



(5.10) 
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ioT m = 0, . . . ,n — 1, provide explicit examples of (unweighted) unitary 1-designs for all 
n= \S!\ >(f. 

To treat the general case, for an arbitrary permutation a G S„, define the permutation 
operator 

P{a) = Pa(l)a{2)...a(n) '■= Yl l^l) Oa(l) I ® l^s) 0a(2) I ® " " " ® I jn) Oa{n) I , (5.11) 

jl,J2,---,jn 

which acts on (C^)®" by permuting its subsystems accordingly: 

Pfcifc2...fcJ^fci) ® l^fea) ® ■ ■ ■ ® l^fcj = l^l) ® 1^2) ® ■ ■ ■ ® |^„) , (5.12) 

and in terms of operators, 

^fcifc2...fc„ (^fci ® ^fe ® ■ • ■ ® ^fcj Pk^k2...kJ = Ai ® A2 ® ■ • ■ ® A„ . (5.13) 

The composition of permutation operators then follows that for permutations, P{o-)P{t) = 
P{aT), which means P((t)'^ = P(o--i). Note that P21 = T. 

In general, the RHS of Eq. (5.1) can be integrated explicitly using group theoretical 
methods [31, 32]. The result is 

/ dfi{x)u{xr'^{u{xr'y = v wg(d,t,ar-i)p,(i)+,,...,,(,)+,,,-i(i),...,,-i(,), (5.14) 

where 

;(A)<d 

is called the Weingarten function. Here the sum is over all partitions A = (Ai,...,A() 
of the integer t (i.e. nonincreasing sequences of nonnegative integers summing to t) with 
length /(A) < rf, where /(A) := maxA >o j- The character on the conjugacy class Ka of Sj 
corresponding to A h t is denoted by x^ ^'^'^ W6 take SA,d(l) = sx^^il^ ■ ■ ■ A) for the Schur 
function SA,d(a;i, . . . , Xd) [29]. For any partition A h t one has 

^J\-t 

and in particular, S(i 1)^(1) = d{d + l)/2 and S(2o)d(l) = c?(c? — l)/2. This means 
Wg(rf, 2, (1, 1)) = l/(rf2 - 1) and Wg(rf, 2, (2, 0)) = ~l/d{<f - 1), giving 

d/i(a;) f/(a;) O U{x) ® U{x)^ ® U{x)^ 

PU(d) 

1 1 

= -„ (P3412 + P4321) — 'TTn 7\ (-P4312 + P3421) . (5.17) 

Thus our definition of a unitary 2-design [Eq. (5.1)] can be rewritten as 
Yw{x) U{x) (g) U{x) (g) U{xy (g) U{xy 

P3412 + P4321) — TTJ? TT (-P4312 + P3421) . (5.18) 



d^-l' '^'^ ' ^■'''' d{d^ - 1) 
The following is partly due to Gross et al. [11, Theorem 2] 
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Theorem 5.5. Let {!^,w), 3l C PU((i), he a weighted 2-design. Then 

m > {d^-lf + l, (5.19) 

with equality only if w{x) = l/\^\ and 

\tT[U{xyU{y)]\' = 1-^, (5.20) 

for all X ^ y E & . 

Proof. Multiplying Eq. (5.18) on the left by P2341 and on the right by P234i^ = -P4123 implies 

^ w{x) U{x)^ ® U{x) (g) U{x) (g) U{xy 

1 1 

= ,2 _ . (-P3412 + -P2143) — ,/ ,2 _ 1 N (-P3142 + -P2413) 5 (5.21) 

given Eq. (5.13) and since, for example, P2341-P3412-P4123 = -P2341-P2341 = -P3412. Now multiply 
this equation on the right by A (g) / (g /, where A G End(C'^) (g End(C'^), and trace out the 
first pair of subsystems. The result can be written in terms of a superoperator: 

S{A) := ^^(a;) tr [(f/(x)f ® U{x))A] U{x) ® U{xy (5.22) 

_ A + tT{AT)T {l(g)tTMT))T+{ti2iAT)(g)l)T 

~ d'-l d{d^-l) ' ^^■^'^^ 

by rewriting the permutation operators explicitly in terms of their definition [Eq. (5.11)] 
and simplifying. 

Now let {Ek}j^^Q^ be an orthonormal operator basis for End(C'^) with the choice Eq = 
I/vd. The remaining operators Ei, . . . , E^^^i then span the (rf^ — l)-dimensional subspace 
of traceless operators: tT^E^) = \/dtT{ElEk) = for all k > 0. Consider the action of S on 
{Ej ® Ek)T : 

S{{Ej ® Ek)T) = -^—j {{Ej ® Ek)T + dHjQ5ko{EQ ® Eq)T 

- (5,o(^o ® Ek)T - 5ok{Ej ® Eo)t) (5.24) 



{Eo ® Eo)T , J = k = 0; 

{E,®Ek)T/{d^-l), j,k>0- (5.25) 

, otherwise , 

identifying / = yrf-Eo- Thus the d'^ orthonormal operators {Ej (g Ek)T diagonalize S and, 
in particular, rank' (5) = (rf^ — 1)^ + 1 (ordinary rank). But we must have \^\ > rank' (5), 
which is Eq. (5.19). 

If 1^1 = rank'(iS) then {U{x)(^U{xy}x£S> is necessarily a linearly independent set. Fixing 
y E Si and considering S{U{y) (g [/{yY) shows that 

{d^ -l)J2 ^(^) I tr[f/(a;)"ff/(i/)] |2 U{x) ® f/(a;)f = U{y) ® Uiy^ +(d-'^)T , (5.26) 
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which, upon setting T = d'^^^^w{x)U{x) ® f/(a;)^ [Eq. (5.8)], can be rewritten as 

{{(d'-ir+l)w{y)-l}U{y)^U{yy 

+ J2w{x){{d^-l)\tT[U{xyU{y)]\^-d^ + 2}U{x)(^U{xy = 0. (5.27) 

When 1^1 = (rf^ — 1)^ + 1 hnear independence thus requires (rf^ — 1)| tr[f/(a;)^[/(|/)]p = rf^ — 2 
for all a; 7^ y and ((rf^ — 1)^ + l)w(|/) = \^\w{y) = 1. The same is true for all y E ^. D 

In general, for each positive integer t and d, we would like to know the quantity N{t, d), 
which we use to denote the minimum number of unitaries needed to construct a weighted 
t-design in PU{d), or less ambitiously, bounds on this quantity. This is a difficult problem. A 
general lower bound, however, might be obtainable from the theory of Levenshtein [33, 34]. 

Our own numerical searches have not revealed the existence of 2-designs achieving \^\ = 
(d^ — 1)^ + 1 and Gross et al. [11] have conjectured their nonexistence. If such designs did 
exist then they might be dubbed tight designs, which is standard terminology in the theory 
of t-designs [7] (but unrelated to the concept of tight frames). As noted in Theorem 5.5, tight 
PU((i) 2-designs are necessarily equiangular. They are analogous to tight CP'^^^ 2-designs, 
i.e. symmetric informationally complete POVMs (SIC-POVMs) [35], which in contrast are 
conjectured to exist in all dimensions. The analogy to a complete family of mutually unbiased 
bases (MUBs) [36, 37], however, does exist in certain dimensions. 

A subset {f/j} -^Q^ C U((i) is a unitary operator basis for End(C'^) if ii{Uj^Uk) = d5jk for 
all < j,k < d'^ — 1. In analogy with the case of vector bases, we call a pair of unitary 
operator bases, {Uj},^^ and {Vk}j^Sj, mutually unbiased if 

|tr(f//\4)|2 = 1 (5.28) 

for all 0<j,k<d^-l. Define the embedding ^ : U(rf) ^ H^^ = M(^'-i)' by 

|^(f/)) := Uo\\U){U\) = \\U){U\-I/d'), (5.29) 

where \U){U\, Hq and Hg'^ are defined in Eqs. (2.2), (2.14) and (2.17), respectively. The set 
{'d{Uj)}jjQ^ then specifies the vertices of a regular simplex in the (rf^ — l)-dimensional sub- 
space of Hq'^ for which its members span. Mutually unbiased bases correspond to orthogonal 
subspaces, 

im^rnvk)) = m\v,)\'-^^ = ^iMf/.v.)^-^ = o, (5.30) 

of which, there can be at most {dim.B.Q'^) / {d"^ — 1) = rf^ — 1 many. A set of c?^ — 1 unitary 
operator bases with the property that each pair is mutually unbiased is thus called a complete 
set of mutually unbiased unitary-operator bases (MUUBs). 

Now consider an arbitrary family of subsets, Mq, . . . , ^m-i C Pl]{d), where each member 
^a = {e|}^^^^ specifies a unitary operator basis {U{e'^)}'^^^ and is appointed a positive 
weight Wa- By Theorem 5.4, if 

m— 1 cP — 1 

Y, w^w, Y, |tr[f/(e|)tf/(et)]r = 2 , (5.31) 

a,b=0 j,k=0 
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then their union 2! = Ua^a forms a weighted 2-design with weight w{x) = ^^ Wal.<g^(a;). In 
the context of quantum process tomography it is desirable for the weight to remain constant 
across elements of the same basis (so the POVM that the design specifies can be implemented 
by a series of orthogonal measurements). We have thus made this a requirement. The set 
indicator function, ly{x) := 1 if x G ^ and otherwise, is used to take care of any multi- 
plicity across different bases. Notice that the normalization of w{x) implies normalization 
of the basis weights: J2a'^a = l/(f- 

It is straightforward to confirm [via Eq. (5.31)] that a complete set of MUUBs forms a 
unitary 2-design when Wa = l/m(P. The following theorem shows that such sets are optimal, 
in that we always need m > c?^ — 1 unitary operator bases to construct a weighted 2-design, 
with equality only if the bases are mutually unbiased. 

Theorem 5.6. Let d> 1 and let SSq^ . . . , SS^-x C PU((i) specify a family of unitary operator 
bases for End(C'^), where the union 2 = Ua-^a forms a weighted 2-design with weight 
function w{x) = '^^'^a^-'^ai^) f^f^ some choice of basis weights Wq, . . . ,Wm-i > 0. Then 
m> d^ — 1 with equality only if Wa = l/md"^ for all a and the bases are pairwise mutually 
unbiased. 

Proof. Theorem 5.5 with \2\ = md"^ immediately shows that we require m > d'^ — 2 + 2/d'^, 
which means m > d"^ — 1 whenever li > 1. In the case of equality, note that by Theorem 5.4 
[or Eq. (5.31)] we require 

a a^b j,k 

where we have defined the positive numbers A"^ := | tr[f/(e")''"?7(e^)]p. Moreover, Theo- 
rem 5.4 implies that the LHS of Eq. (5.32) is minimal with respect to the variables Wa and 
A°^ under the appropriate constraints, two of which are ^^ Wa = 1/d'^ and 

E^S = Y.\^U{e^)\U{e\W = rf^Tr(I.I) = d\ (5.33) 

j,k j,k 

since all unitary operator bases satisfy ^- \U{ej)){U{ej)\ = dl. We will now minimize the 
LHS of Eq. (5.32) under these two constraints. The minimum of "^j ^i-^'jk)'^ subject to 
Eq. (5.33) occurs only when A"^ = 1 for all < j,k < d'^ — 1, i.e., when ^a and ^j, are 
mutually unbiased. Then the LHS of Eq. (5.32) reduces to 

^' E ^' + ^^ E ^"^'' = ^^(^' - 1) E ^' + ^ ' (^-3^) 

a a^b a 

and here the minimum (under '^^Wa = 1/c?^) occurs only when Wa = 1/md'^ for all < a < 
m—1. With this value, Eq. (5.34) reduces to the RHS of Eq. (5.32) when m = d'^ — 1. Equality 
in Eq. (5.32) thus requires the bases to be pairwise mutually unbiased and Wa = 1/md 
whenever m = d"^ — 1. D 

Theorem 5.6 is the equivalent of Ref. [38, Theorem 3.2] for the case of unitary designs. 
Many examples of unweighted unitary 2-designs were described by Gross et al. [11]. Of these, 
the Clifford designs were found closest to optimal. These are sets of unitary operator bases 
which form subgroups of the projective Clifford group PC{d) [39, 40] and have cardinahties 
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\3l\ = k(P{(P' — 1) for some integer k. When A; = 1, Clifford designs are known to exist in 
dimensions (i = 2, 3, 5, 7, 11 [41]. By Theorem 5.6, each of these examples must be the union 
of a complete set of MUUBs. Although no unweighted unitary 2-designs of smaller size were 
found by Gross et al. [11], weighted unitary 2-designs can surpass this record. This is the 
case for PU(2) 2-designs, which are described in detail next. 

5.1. PU(2) t-designs 

In dimension 2, unitary designs are equivalent to real projective designs, which in turn 
are equivalent to antipodal spherical designs. To see this, simply note that PU(2) = MP^ 
through the relation 

e^-^f/ = roJ + ilriX + rsF + rgZ) , (5.35) 

where X := (lo), Y := (°^*), and Z := ( o -i ) S'-re the Pauli matrices. Each unit vector 
(ro,ri,r2,r3) G M^ specifies a line in MP^, and through Eq. (5.35), an equivalence class of 
unitaries U G U(2) differing only be a phase factor. Under this map each t-design in PU(2) 
gives a t-design in M.P'^ and vice versa. This is because distances are preserved: | ii{U'^V)\'^ = 
4(r|s)^ where (r|s) := "^j^rkSk and r (respectively, s) corresponds to U (respectively, V) 
through Eq. (5.35). Theorem 5.4 then transforms to the equivalent for real projective designs. 
This relationship also gives 

Real projective designs are rarely studied in the literature. It is well known, however, that 
t-designs in MP""^ are equivalent to antipodal (2t + l)-designs in S*""^ with twice as many 
points (assuming antipodal pairs are appointed the same weight). The antipodal points 
of the spherical design are simply the intersections between the lines of the real projective 
design and the unit sphere. Additionally, an antipodal spherical (2t -|- l)-design can be 
created from (2t)-design by simply appending the antipodal points to the design: if ^ is a 
(2t)-design in S^"^ then ^ U (— ^) is an antipodal (2t + l)-design in S"-^^. 

The above relationships can be used to translate known results in the literature to the 
case of unitary designs. For example, the lower bound of Delsarte et al. [7] on the number 
of points needed to construct a (2t -|- l)-design in S^ shows that 

m > ^(t + l)(t + 2)(t + 3) (5.37) 

o 

for a t-design in PU(2), with equality only if the design is unweighted [34], i.e. w{x) = \j\^\. 
A design which achieves this bound is generally called tight. It is known, however, that 
tight S'^ (2t + l)-designs exist only for the trivial t = 1 case [7, 42, 43] (see Ref. [44] for a 
summary). Thus we can increase the RHS of Eq. (5.37) by 1 when t > 1. Further bounds 
on the cardinality of a PU(2) t-design are summarized in Table I. 

The first row of Table I corresponds to t = 2, which we now explain in detail. The Delsarte 
bound [Eq. (5.37)] shows that PU(2) 2-designs must have at least 10 points. However we 
can increase this bound to 11 since there are no tight PU(2) 2-designs. The 11 columns of 
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t 


Delsarte 


Nit, 2) > 


iV(t,2)< 


2 


10 


11-^ (12'') 


11" (12^^) 


3 


20 


21'' 


23^= (24/) 


4 


35 


37^ 


43" 


5 


56 


605 


60^^ 


6 


84 


85-^ (89*) 




7 


120 


1349 


264-? 


8 


165 


166'' (180*) 




9 


220 


2509 


360^^ 


10 


286 


287^^ (318*) 





"No tight 5*3 (2t + l)-designs exist for i > 1 [7, 42, 43]. 
'No antipodal unweighted 22-point 5*^ 5-designs exist [45]. 
"^A weighted 11-point PU(2) 2-design exists [Eq (5.38)]. 

''The 24 vertices of the 24-ceU form an antipodal unweighted S^ 5-design [46] , and thus also an unweighted 
PU(2) 2-dcsign. This design is a minimal subgroup of PC(2). 
•^A weighted 23-point S^ 6-design and a weighted 43-point S^ 8-design exist [47] . 
^The projective Clifford group PC(2) is an unweighted PU(2) 3-design. The corresponding S^ 7-design is 

formed by the vertices of 2 copies of the 24-cell [46] . 

^The linear programming bounds for weighted 5*^ {2t + l)-designs of Ref. [48]. 

''The 120 vertices of the 600-cell form an antipodal unweighted S^ 11-design. This is the unique minimal 
unweighted S^ 11-design [49, 50]. 

*Yudin's bound [51] on unweighted spherical designs gives \^\ > 77/(7: — 2x\J\ — x^ — 2arcsinx) for un- 
weighted PU(2) t-designs, where x is the largest zero of the Jacobi polynomial P^t^x (x)- 

^An antipodal weighted 528-point S^ 1 5-design can be constructed from shells of a Euclidean lattice [48]. 

'^Thc union of the 120 vertices and the 600 face centers of the 600-cell form a weighted antipodal S^ 
19-design [52]. The vertices have weight 1/504 and the faces have weight 2/1575. 

TABLE I: Known bounds on N{t,2), the minimum cardinality of a weighted t-design in PU(2). 
The Delsarte lower bound [Eq. (5.37)] is included as a reference point, and for completeness, bounds 
on the minimum cardinality for an unweighted design are included in parentheses. 
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(5.38) 



where a = l/v3 and b = a/2/3, specify a weighted PU(2) 2-design [through Eq. (5.35)] which 
achieves the bound. The weight appointed to the first column is 1/16 while the remaining 
all have weight 3/32. Reznick [45] has shown that there are no antipodal unweighted S^ 
5-designs with 22 points. Thus this design is necessarily weighted. The 24 vertices of the 
24-cell form an antipodal unweighted S^ 5-design [46], and thus also an unweighted PU(2) 
2-design with 12 points. The elements of this design correspond to the subgroup {HR, R^) 
of the Clifford group C(2) = {H,R), where H = -j=(^\ _\^ and R = ( J ?)• It is formed by 
the union of a complete set of MUUBs and is the first in the family of Clifford designs. 
Finally, Shamsiev's explicit constructions of antipodal weighted spherical designs [53, 
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Theorem 1] show that weighted PU(2) t-designs with (t + 1)^/2 points exist for all odd t. 
This upper bound on the cardinality together with the Delsarte lower bound [Eq. (5.37)] 
means that N{t,2) = Q{t^). 

6. OPTIMAL ANCILLA-ASSISTED QUANTUM PROCESS TOMOGRAPHY 

We now return to our immediate task of optimizing the measurements used for ancilla- 
assisted quantum process tomography. Throughout this section we assume that either ^ = 
Q(C'^ C^), ^^'^ or ^"'^. The results of Sec. 4 are first summarized for these specific cases. 

Recall that our error for tomographic reconstructions of quantum states was defined in 
terms of the Hilbert-Schmidt distance [Eq. (4.4)]: 

e(^'«)(p) := E \\\p-piy,,...,y^)r] , (6.1) 

yi,---,yN I J 

where p is the linear tomographic estimate of p given A^ measurement outcomes yi, . . . ,yN 
[Eqs. (4.2) and (4.3)]. When applied to the output states of quantum channels, with fixed 
input Pi = |/)(/|, this distance measure naturally induces the analogous Hilbert-Schmidt 
distance for superoperators (see the appendix): 

11^ -^11 = IIp-pII, (6.2) 

where p = {£®T){pi) and p = {£®X){p{) through the Jamiolkowski isomorphism [Eq. (2.3)], 
and \\S\\ := -^/Vt{S^S) for any superoperator S. Although there are more appropriate dis- 
tance measures for quantum channels, which properly reflect the probabilistic interpretation 
of a quantum state, the Hilbert-Schmidt distance is the most natural choice for linear to- 
mographic reconstructions of quantum states. 

Now setting p = p{a, U) := UaW for some fixed output state a E ^, recall that we 
defined the average [Eq. (4.16)] and worst-case [Eq. (4.32)] error over different Hilbert-space 
orientations Us, Us. G U((i) : 



ei^'«^(-) 


:= / / d/i(f/,)dp([/a) e(^'«)(p(a. Us (E) f/a)) ; 

J Jv{d) 


(6.3) 


e^If\cr) 


:= sup e(^'«)(p(a,f/s®f/a)). 

C/s,C/aGU(d) 


(6.4) 



Finally, recalling that the subsets JS^'^ and ^^^ respectively span 6' = d'^{d'^ — 1) + 1 and 
6' = {(P — 1)^ + 1 dimensions of H(C'^ (S> C*^), the following corollary restates Theorem 4.4 
and Corollary 4.5 for these special cases of interest. 

Corollary 6.1. Let F : QS(e^) -^ H(C'^(8>C'^) be a POVM which is informationally complete 
w.r.t. ^ C Q(C'^ C'^). Then for any fixed quantum state a E JS, 



( 



el^'^V) > ei^'«)(a) > < 



]-{d^^d^ -\- tr(a')) , z/ ^ = Q(C'^ ® C^) ; 

]- U" -d^ + l/d^ - tiia^)) , if ^ = ^^'; (6.5) 

— (d"^ -3d^ + 3- tT{a^)) , z/ ^ = ^^% 



N 
1 

N 
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for all reconstruction OVDs Q. Furthermore, equality in the RHS of Eq. (6.5) occurs if and 
only if Q = R and F is a tight rank-one POVM with supp(jF) = span(,^), in which case 
we also have equality in the LHS of Eq. (6.5). 

Tight rank-one POVMs thus describe the class of optimal measurements for linear ancilla- 
assisted quantum process tomography in both an average and worst-case sense. But do such 
measurements exist? When .^ = Q(C'^ ® C^), which is the class of output states of general 
quantum operations (and included in Corollary 6.1 for comparison), we recover the results 
of Ref. [5]. Here it was found that tight rank-one POVMs exist and are in fact equivalent 
to weighted complex projective 2-designs. 

Now consider the case .S = =S"'^. Our condition for a tight rank-one POVM [Eq. (4.28)] 
with D = d^,6= {d^ -1)^ + 1 and 

n^ = n^uc = |Ao® Ao)(Ao® Aol + V |Aj® Afc)(Aj® Afcl , (6.6) 



j,k>0 



becomes 



J^ = / dT{x) \P{x)){P{x)\ = |Ao®Ao)(Ao®Ao| + 3^5 — 7 52 |A, ® Afe)(A, ® A^l , (6.7) 



d^-1 ^ 

j,k>0 



using the orthonormal Hermitian operator basis {Afcj^^Q^ of Sec. 2 [see above Eq. (2.9)]. 
Equivalently, under the isomorphism |A)(-B| ^^ A(E) B'' we can rewrite this last form as 

dr(x) P{x) P{x) = (Ao ® Ao) ® (Aq ® Aq) + ;t^— - ^ (A^ ® A^) ® (A^ ® Afc) . (6.8) 

j,k>0 

Now multiplying on the left by P1432, taking the trace, and applying the easily confirmed 
identity tr[Pi432(A (g) B (g) C (g) D)] = tr(A) tr(C) tv{BD), we find that the types of tight 
rank-one POVMs corresponding to .^^^ must satisfy 

/ dr(x)tr,[{trjP(x)]}2] = d. (6.9) 

We know that trs[{tra[P]}2] < 1/rf, however, with equality only if P = \U){U\ [via Eq. (2.2)], 
a maximally entangled state. Thus, since the normalization J^ dr(x) = D = d^ must be 
adhered to, Eq. (6.9) can be satisfied only if P{x) = \U{x)){U{x)\, r-almost everywhere, for 
some function U : ^ ^' \J{d). 

We have established that all tight rank-one POVMs corresponding to JS'^'^ have POVDs in 
the form P(x) = \U{x)){U{x)\ where U : ^ ^ \J{d). It is thus natural to take JT C PU(rf) 
and let U{x) denote a representative from the equivalence class of unitaries x G Pl]{d) (as 
in Sec. 5). We will henceforth assume that this is the case. Now note that Tr(jF^) = 2 under 
Eq. (6.7). This means 

^ ff dT{x)dT{y)\tT[U{xyU{y)]\' = 2 , (6.10) 

given that \{P{x)\P{y))\'^ = \{U{x)\U{y))\^ = \tT[U{xyU{y)]\yd\ In particular, if ^ is a 
finite set, then by Theorem 5.4, ^ must be a weighted unitary 2-design with weight function 
w{x) = T{x)/d^. In fact. Theorem 5.4 could easily be extended to any subset ^ C PU((i) 
with the condition for equality in Eq. (5.5) (when t = 2) replaced by Eq. (6.11) in the 
following proposition (see e.g. Ref. [5, Theorem 6]). 
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Proposition 6.2. Let F : <B( JT) -^ H(C'^®C'^) be a POVM with supp(J^) = span(=g"'=) and 
assume ^ C PU((i). Then F is a tight rank-one POVM if and only if P{x) = \U{x)){U{x)\ 
with the outcome distribution {j^,T/d'^) satisfying 

^ j dT{x)U{xf^®{U{x)®^)^ = I d^i{x)U{x)®^®[U{x)®^)^ . (6.11) 

That is, if ^ is finite, then {^^r/d^) is a weighted unitary 2-design. 

Weighted unitary 2-designs tlius define tlie class of optimal measurements on the output 
state for linear ancilla-assisted process tomography of unital quantum channels. Proposi- 
tion 6.2 and Corollary 6.1 summarize this main result of the article. By Corollary 5.3, these 
measurements exist in all dimensions. One particularly interesting type is that specified by 
a complete set of MUUBs. This choice allows us to perform optimal ancilla-assisted process 
tomography through a series of orthogonal measurements on the output state. In all cases, 
the optimal reconstruction formula for the output state is [Eq. (4.29) with D = d^ and 
6={d^- 1)2 + 1] 

p = (d'-l) f dp{x) \U{x)){U{x)\ - (l - I) / , (6.12) 

where piles') = tr[F(<f)p] = f^dT{x){U{x)\p\U{x)) are the measurement outcome statistics. 
The corresponding unital channel follows from the Jamiolkowski isomorphism [Eq. (2.3)]: 

dp[x)U[x) QU[x)' - [d- 



E = {S -\) \ dp{x) U{x) U{xy - (d--)l. (6.13) 

J a- \ d/ 



Finally, consider the case ^ = ^^^. Our condition for a tight rank-one POVM [Eq. (4.28)] 
with D = d^, 5 = d'^{d'^ — 1) + 1 and Hjc- = H^gc now becomes 



J^ = I dT{x)\P{x)){P{x)\ = \\o^\o){\o^\o\ + ^y2\\j^\k){\j^\k\. (6.14) 



d^ ■ n 
k 



But following the exact same procedure as in the unital case we find that the types of tight 
rank-one POVMs corresponding to ^^^ must also satisfy Eq. (6.9), and thus, we again have 
P{x) = \U{x)){U{x)\, r-almost everywhere, for some function U : ^ ^ \J{d). In this case 
Tt{J-''^) = 2 — 1/d'^ under Eq. (6.14), however, which would violate Theorem 5.4. Our only 
conclusion can be that tight rank-one POVMs with supp(jF) = span(^sc^ ^^ ^^^ exist. The 
lower bound on the error rate [Eq. (6.5)] still applies, but it is unattainable. 



7. CONCLUSION 

In this article we have shown that weighted unitary 2-designs specify optimal measure- 
ments on the system-ancilla output state for ancilla-assisted process tomography of unital 
quantum channels (Corollary 6.1 and Proposition 6.2). Although existence is known in all 
dimensions (Corollary 5.3), it remains to construct specific examples of these designs with 
sizes as close as possible to the lower bound (Theorem 5.5). Complete sets of MUUBs are 
known in dimensions d = 2,3,5,7,11, and form unweighted unitary 2-designs with sizes 
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close to optimality. Each of these in fact specifies a minimal series of optimal orthogonal 
measurements (Theorem 5.6). Weighted unitary 2-designs of smaller size exist in dimension 
2, however (see Table I), and thus further reductions should be expected in higher dimen- 
sions. The optimization of the measurements used for ancilla-assisted process tomography 
of general quantum channels remains an open problem. 
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APPENDIX: QUANTUM OPERATIONS 

Before describing quantum operations let us take a moment to set notation. Following 
Caves [54] (see also Refs. [55, 56]) we write a linear operator A in vector notation as \A). 
The vector space of all such operators, End(C'^) = C^ , equipped with the Hilbert-Schmidt 
inner product {A\B) := tT{A^B), is a Hilbert space, where we think of {A\ as an operator 
"bra" and \B) as an operator "ket." Addition and scalar multiplication of operator kets 
then follows that for operators, e.g. a\A) + b\B) = \aA + bB). The usefulness of this 
notation becomes apparent when we consider linear maps on operators, i.e. superoperators. 
Given an orthonormal operator basis {i?fc}fe=i C End(C'^), {Ej\Ek) = 5jk, a superoperator 
S G End(End(C°')) = C^ may be written in two different ways: 

S = J2^JkE,&Ek^ = Y,Sjk\Ej)iEk\ (seC"'"'"). (A.l) 

The first representation illustrates the ordinary action of the superoperator, 

SiA) := J2^JkE,AEk^ , (A.2) 

3,k 

which amounts to inserting A into the location of the '0' symbol. The second reflects the 
left-right action, 

S\A) := Y,^Jk\Ej){Ek\A) = J^ ^jkEj ti (eM) , (A.3) 

j,k j,k 

where the superoperator acts on operators just like an operator on vectors. The identity 
superoperators relative to the ordinary and left-right actions are, respectively, I := iQl and 
I := J2k \Ek){Ek\. We also define Tr(5) := J2kiEk\S\Ek) and \\S\\ := ^y^T{S^S). The latter 
is the Frobenius norm of S induced by its left-right action. Here 5^ is the left-right adjoint, 
i.e. (yl|iS^|i?) := (i?|iS|A)*, and TZS denotes the left-right composition of two superoperators: 
{TZS)\A) := TZ\B) where |-B) = 5|A). Consult Refs. [54-56] for analogous concepts relative 
to the ordinary action. 

The particular choice of operator basis {Ek = Ek^k2 '■= \ki){k2\}ki fe2=i' where {\k)}f^^ is 
a fixed "standard" basis for C^, defines the so-called Jamiolkowski isomorphism [13]. The 
matrix s in Eq. (A.l), now called the process matrix, then satisfies 

■sjk = s,,,„k,k, = diinl ® {j2\)[iS(g)I)i\I){I\)]i\k,) ® \k2)) , (A.4) 
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where |/) := X]fc=i 1^) ® \k) /yd. Note that Tr(5'''5) = tr(s'''s) in general, and thus, the 
Hilbert-Schmidt superoperator distance between S and TZ can be rewritten as ||iS — TZ\\ = 
\\s — r\\, where r and 7?. are related through Eq. (A.l). The upshot of the current choice of 
operator basis is that when S and 7^ are quantum channels, as in this article, then s and r 
specify standard-basis matrix elements of output quantum states with fixed input |/)(/|. 

A superoperator S is called positive if it maps positive operators to positive operators 
under its ordinary action. If, in addition, for any auxiliary system C^"* we have {S^X) (A) > 
whenever A > 0, A G End(C'^ (8> C'^'"), then S is called completely positive. Alternatively, 
S is completely positive if and only if (A|i5|A) > for all A G End(C'^) [57]. That is, S 
is completely positive if and only if 5^ = 5 and S has nonnegative left-right eigenvalues. 
Diagonalizing, we see that S is completely positive if and only if it can be rewritten in an 
operator-sum form, called the Kraus representation [57-59]: 

S = Y,Bk&Bk^ = J2\Bk)iBk\ , (A.5) 

k k 

where the operators Bj. G End(C'^) are called Kraus operators. A superoperator S is said 
to be trace nonincreasing if tr[iS(A)] < tr(A) for all A, and moreover, trace preserving if 
tr[5(A)] = tT{A) for all A. Thus the Kraus operators together satisfy ^^^Bk^Bk < I when 
S is trace nonincreasing and ^^ B^ B^ = I when S is trace preserving. 

A quantum operation is a superoperator-valued measure ^[■] : ^(^) — > End(End(C'^)) 
on an outcome set ^ , which satisfies (1) S[J^] is completely positive and trace nonincreasing 
for all ^ G <8( JT) with ^[0] = 0, (2) ^[Ur=i -^k] = J2T=i£['^k] for any sequence of disjoint 
sets ^k € ^{^), and (3) S[^] is trace preserving. In this article we always take QS(<^) to 
be the Borel a-algebra. Quantum operations can be nonselective (e.g. channels), in which 
case there is only one output p' = S{p) := £[^] (p) for each input p, but are generally selective 
(e.g. measurements), in which case the output p' = S[y]{p)/p{y) occurs with probability 
p{ry) = tr(£^[=5^](p)). To make the connection to quantum measurements simply note that 

F( ■ ) := ^^ Ak{ ■ ) Ak{ ■ ) is the POVM describing the outcome statistics of the measuring 

instrument S[-] = Y.kM-) & M')^ ■ 
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